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1 Spaces that deserve more space 

Scientists and science students of different fields are very familiar, 
in various degrees of sophistication, with vector spaces. Vectors are 
used to model places, velocities, forces, generators of rotations, elec- 
tric fields, and even quantum states. Vector spaces are among the 
building blocks of classical mechanics, electromagnetism, general re- 
lativity, quantum theory; they constitute therefore an essential part 
of physics and mathematics teaching. 

Physicists also like to repeat, with Newton [1, Liber III, regula I; 
0, § 293], a maxim variously attributed to Ockham or Scotus: 'frustra 
fit per plum quod fieri potest per pauciora' 1|3||- Applied to the connexion 
between mathematics and physics, it says that we should not model 
a physical phenomenon by means of a mathematical object that has 
more structure than the phenomenon itself. But this maxim is some- 
times forsaken in the case of vector spaces, for they are sometimes 
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used where other spaces, having a different or less structure, would 
suffice. A modern example is given by quantum theory, where 'pure 
states' are usually represented as (complex) vectors; but the vectors 
^) and Alp, A 7^ 0, represent the same state, and the null vector rep- 
resents none. Clearly the vector-space structure is redundant here. 
In fact, pure quantum states should more precisely be seen as points 
in a complex projective space § 1.3. 1;@]. 

Another example is the notion of reference frame in classical gal- 
ileian-relativistic mechanics: such a frame is often modelled as a vec- 
tor space, wherein we describe the place occupied by a small body by 
its 'position vector' (with respect to some origin). But suppose that 
I choose two places, e.g. (on a solar-system scale) those occupied by 
Pluto and Charon at a given time, and I ask you: what is the sum 
of these places? This question does not make very much sense; and 
even if you associate two vectors to the two places and then perform 
a formal sum of those vectors, the resulting place is devoid of any 
physical meaning. Thus, even if we usually model places as vectors, 
it is clear that the mathematical structure given by vector addition 
has no physical counterpart in this case. 

On the other hand, I can ask you to determine a place in between 
the places occupied by Pluto and Charon such that its distances from 
the two planets are in an inverse ratio as the planets' masses m-p, mq; 
in other words, their mass centre. You can obtain this place unam- 
biguously, and it also has a physical meaning: it moves as the place 
occupied by a body with mass mp + niQ under the total action of the 
forces acting on Pluto and Charon. It turns out that the operation 
of assigning a mass-centre can be modelled in a space that has less 
structure, and is therefore more general, than a vector space: an affine 
space. 

Affine spaces have geometrically intuitive properties and are not 
more difficult to understand than vector spaces. But they are rarely 
taught to physics students, and when they are, they appear as by- 
products of vector spaces. This is reflected in textbooks of mathem- 
atical methods in physics. Amongst the old and new, widely and 
less widely known textbooks that I checked [6-21J, orJy Bamberg & 
Sternberg [18], Szekeres | |2l]| . and obviously Schouten js] give appro- 
priate space to affine spaces; almost all others do not even mention 
affine spaces at all, although all of them obviously present the theory 
of vector spaces. 

The students who have heard about affine spaces and would like 
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to know more about them will find heterogeneous material, scattered 
for the most part in books and textbooks about general geometry. 
Part of this material has an analytic flavour, part a geometrical fla- 
vour; and to get a more all-round view some patch-work is needed. 
It is the purpose of these notes to offer to the interested students such 
patch-work, emphasizing the dialogue between the analytic and the 
synthetic-geometric presentations, and some references. Knowledge 
of basic Euclidean-geometrical notions is assumed. 

Closely related to affine spaces are convex spaces. These are also 
geometrically very intuitive, and are ubiquitous in convex analysis 
and optimization. Although their range of application in physics 
is maybe narrower than that of affine and vector spaces, they ap- 
pear naturally in probability theory and therefore in statistical phys- 
ics, be it the statistical mechanics of mass-points, fields, or continua; 
and they are of utmost importance in quantum theory, being be- 
hind many of its non-classical properties. Quantum theory is indeed 
only a particular example of general plausibilistic physical theories 
or 'statistical models', and convex spaces are the most apt spaces to 
study the latter. 

Students who have heard about and are interested in the general 
theory of convex spaces will find even less, and more hidden, mater- 
ial than for affine spaces. These notes offer to those students some 
references and a general overview of convex spaces too. 

At the end of these notes I shall give an example of application 
of affine spaces, related to the previous discussion about Pluto and 
Charon. Extended application examples for convex spaces are left to 
a future note. 

2 Affine spaces 

2.1 Analytical point of view 

An affine space is a set of points that is closed under an operation, affine 
combination, mapping pairs of points a, b and pairs of real numbers 
A, fi summing up to one to another point c of the space: 

{a,b,A, ji) I-)- c = AaS iJ.b, A,^eR, A + n = 1. (1) 

The intuitive properties of this operation, including the extension 
to more than two points, I do not list here. It behaves in a way 
similar to scalar multiplication followed by vector addition in a vector 
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Space; but as the symbol 'ffl', used here instead of '+', reminds us, 
'multiplication' of a point by a number and 'sum' of two points are 
undefined operations in an affine space: only the combination above 
makes sense. This operation has a precise geometric meaning which 
will be explained in § 12.31 One usually writes simply Aa + fxb, a 
notation that we shall follow. Whereas a vector space has a special 
vector: the null vector, an affine space has no special points and is 
therefore more general than a vector space. 

A set of points is affinely independent if none of them can be written 
as an affine combination of the others. The maximum number of 
affinely independent points minus one defines the dimension of the 
affine space. An affine basis is a maximal set of affinely independent 
points. Any point of the space can be uniquely written as an affine 
combination of basis points, and the coefficients can be called the 
weights of the point with respect to that basis. A choice of basis 
allows us to baptize each point with a numeric name made of n reals 
summing up to one, where n is the dimension of the space plus 
one. This n-tuple can be represented by a column matrix. An affine 
combination of two or more points corresponds to a sum of their 
matrices multiplied by their respective coefficients. For particular 
affine spaces whose points are already numbers, like the real line R, 
such baptizing ceremonies are usually superfluous. 

Some subsets of an affine space are affine spaces themselves, of 
lower dimensionality. Given two points fli, fl2/ the line flifl2 through 
them is the locus of all points obtained by their affine combinations 
for all choices of coefficients (Ai,/\2), /\i + A2 = 1. Given three af- 
finely independent points fli, fl2/ '^3/ the plane flifl2^3 through them 
is the locus of all points obtained by their affine combinations for 
all choices of coefficients {^1,^2, A3), Y^i^i = 1- And so on for n + 1 
points and ^-dimensional planes, the latter called n-planes for short. 
All these are affine subspaces: a line, of dimension one; a plane, of di- 
mension two; etc. In general, given a finite set of points {fli, . . . , fl^}, 
not necessarily affinely independent, their affine span aff {«!, . . . , a, } is 
the smallest affine subspace containing them. It is simply the locus of 
all points obtained by affine combinations of the {fl,} for all possible 
choices of coefficients. 

Given four points flj, fl2/ ^1/ ^2/ the lines flifl2 and ^'1^2 are said to 
be parallel, written aia2 II bxh^, according to the following definition: 

a\a2 II l}\h2 ^2 = ^1 ~ ^^1 + ^^2 for some A. (2) 
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For two planes flia2«3 arid bi&2^3 to be parallel we must have 

h2 = hi — (A + i^)a\ + /\fl2 + M'^s for some A, /i, 
= bi — {A' + /i')fli + /\'fl2 + m'^^s for some A', ii'. 

And so on. We shall see later that these notions coincide with the 
usual geometrical ones. Geometrically, affine dependence means col- 
linearity, coplanarity, etc. 

2.2 Affine mappings and forms 

An affine mapping or affinity from one affine space to another or to 
itself is a mapping F that preserves affine combinations: 

= E/A,F(fl,), E,^,— 1; (4) 

it therefore maps r-planes into t-planes, where t ^ r, and mutually 
parallel objects into mutually parallel objects. In the following we 
shall often use the summation convention and omit the normaliza- 
tion condition when clear from the context. 

Introducing two affine bases in the domain and range of an affine 
mapping it is easy to see that it can be represented by a stochastic 
{m + l,n + l)-matrix (i.e., with columns summing up to one), operat- 
ing on a point through multiplication by the latter 's column matrix; 
n and m are the dimensions of domain and range. This represent- 
ation is basis-dependent. The rank of the matrix, which is basis- 
independent (and obviously smaller than n + 1 and m + 1), is equal 
to the dimension of the image of the domain plus one. When do- 
main, range, and the image of the domain have the same dimension 
the matrix is square and its non-vanishing determinant, also basis- 
independent, is the ratio of the hypervolumes determined by the im- 
age of the first space's basis and that formed by the second space's 
basis; more on ratio of hypervolumes in § 12.31 

We can define affine combinations of the affinities between two 
affine spaces in a canonical way: (AF+ jj.G){a) := AF{a) + /iG(fl) 
for any two affinities F, G with same n-dimensional domain and m- 
dimensional range (note how the expression 'AF(fl)' by itself has no 
meaning). The set of these mappings is therefore an affine space 
itself, of dimension m{n + l). 
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An affinity from an n-dimensional affine space to the real line 
R can be represented by a single-row matrix with n entries, instead 
of a (2, n) matrix, because as already said the points of the reals 
can numerically represent themselves without the need of an affine 
basis. The matrix representation is still dependent on a choice of 
basis in the domain affine space, though. Such affinities are called 
affine forms or simply /orms. Their set is an affine space; in fact, it 
is even a vector space owing to the vector structure of the reals; its 
dimension in both cases is n + 1, thus larger than that of the original 
affine space (for this reason 1 find the name 'dual space', used by 
some, inappropriate). The action of an affine form on a point a will 
be denoted by v • a. 

Choose a basis (e') in an affine space. In the space of forms, seen 
as a vector space, we can then choose a vector basis {dj) such that 
dj ■ e^ = b'j-, the dj are called dual forms of the basis (e'). The set {dj} 
is however insufficient as a basis if we see the space of forms as an 
affine space: it has to be augmented by another form, like the null- 
form dQ-. a or the unit-form d^- a ^ 1. Here we choose the 
former; (do, dj) is thus an affine basis in the space of forms. 

A non-constant affine form can be geometrically seen as a fam- 
ily of parallel hyperplanes in the affine space: the form has a con- 
stant value on each hyperplane. This family is usually iconized by 
drawing only two hyperplanes: one, unmarked, where the form 
has value zero, and one, marked by e.g. a tick, where it has value 
one. A constant form has no such hyperplanes of course. The r- 
multiple of a form has its unity hyperplane at a distance 1/r times 
the original distance from the zero hj^erplane. If you wonder what 




Figure 1: Scalar multiplication and addition of affine forms 
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I mean with 'distance', given that no such notion is defined in an 
affine space, please read the next section. The sum of two forms is 
a form whose unity hyperplane passes through the intersections of 
their zero and unity hyperplanes, and whose parallel zero hyper- 
plane passes through the intersection of their zero hyperplanes. See 
fig- ID and the nice illustrations in Burke | | 22:,23,] . 

2.3 Geometrical point of view. Translations. Ratios of 
n-volumes 

From a geometrical point of view, an affine space is based on the 
notions of point, line, plane, space, hyperplane, an so on, and the 
notion of (Euclidean) parallelism. I shall take these notions, that can 
be axiomatized in many different ways, for granted. Note that the 
notions of distance and angle are undefined. 

Affine mappings between affine spaces are those that preserve the 
relation of parallelism: they map pairs of parallel objects, like lines or 
hyperplanes, into pairs of parallel objects. A special group of affine 
transformations of an affine space into itself are those that map every 
object into another parallel to, and of the same dimension as, the ori- 
ginal one. They are called translations. To specify a translation u we 
only need to assign a point a and its image a' := u{a). The image 
h' := u{b) of any other point b outside of the line aa' is determined by 
requiring that the line hh' be parallel to aa' and the line a'b' to ab, as 
in fig. |2] (the segment bb' can then be used to construct the image of 



Figure 2: Construction of the image b' — u{b) of b by the translation 
determined by a and a' — u(a) 



lb 
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other points on the line aa'; from this construction it is clear that the 
case of a one-dimensional affine space requires a different approach). 
The translations clearly form a commutative group, the identity be- 
ing the null translation Q . a ^ a, and the inverse of u being the 
translation —u determined by u{a) and its image a = —u[u{a)\. The 
action of this group on the affine space is transitive, faithful, and free. 

If the point fli is the image of a under u, and fl2 the image of 
a under a double application, u' := 2u := u + u := m o u, of the 
same translation, we can say that the oriented segment 002 is twice 
Ofll, or that the latter is half the former, and we can write u = u' /2. 
Generalizing this construction we can define rational multiples of a 
translation, and thence generic real multiples Am, /\ G R, through a 
Dedekind-section-like construction i jzl, § 13.3]. Negative values in- 
dicate a change in orientation. Translations form therefore a vector 
space over the reals, sometimes called the translation space of the ori- 
ginal affine space, and they allow us to speak of the ratio of two 
lengths along parallel lines (but not along non-parallel ones), of two 
areas on parallel planes, and so on with n-areas, up to the ratio of any 
two hypervolumes. The procedure is to divide the first n-area into 
smaller and smaller equal n-rectangles, and to see how many of them 
are in the limit needed to fill, by translation, the second «-area; see 
the example in fig. |3l The ratio between two hypervolumes provides 




Figure 3: The orange and blue areas on the two parallel planes are in 
the ratio 21 /24, as the decomposition into smaller equal rectangles 
shows underneath. With a limiting, similar construction we can com- 
pare parallel areas with curvilinear boundaries 
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a geometric definition of the determinant of an affinity, defined in 
§ 12.21 in terms of the matrix representing the affinity. It should also 
be clear now what we meant at the end of the same section by saying 
that the distance between two hyperplanes is 1 /r times the distance 
between two other hyperplanes, all these hyperplanes being mutu- 
ally parallel: draw any line intersecting all the hyperplanes; then the 
segment intercepted on the line by the first two hyperplanes and that 
intercepted by the last two hyperplanes are in the ratio 1/r. 

The action of a translation u on the point h is usually denoted by 
b + u := u{b) = c'. We also write u = c — b to denote the fact that 
u is uniquely determined by some point b and its image c. Then, 
by what we said above, the translation Am = A{c — b) maps b to 

a point c' such that be' is A times be (negative values indicating a 
change in orientation). The action of the same translation on the 
point a can then be written a + A{c — b). Given another translation 
IJ.V = fi{d — b), the action of the composite translation Au + \xv ovi a 
can be written as a + A{c — b) + \j.{d — b). Generalizing this we obtain 
expressions which are formal sums of affine points with coefficients 
summing up to imity This provides a link between the geometric 
and analytic presentations of an affine space: any affine combination 
/\,fl' can be written and interpreted as the image a + Ai[a^ — a) of an 
arbitrary point a under the composition of the translations Aj{a^ — a), 
and vice versa. Note again that the expression 'a — V does not denote 
a point of the affine space but a particular mapping (translation) onto 
the space. An expression like 'a — b — c' has absolutely no meaning, 
not even in terms of translations. 

A geometric interpretation of the affine combination b = A\ai + 
/\2fl2 is that is a point on the line determined by flj and fl2 ^rid such 
that the unoriented segment fl2^/ i-e- the one fli is not generally an 
endpoint of, is Ai times the segment flifl2, a negative ratio indicating 
that b and oi lie on opposite sides of ^2; ^rid analogously for aib and 
A2. You can prove for yourself that the geometric interpretation of the 
combination b = A\ai + /\2fl2 + -^3^3/ with the fl/ affinely independent, 
is that bis a point in the plane determined by the a,- and such that the 
triangle a2a3b, i.e. the one fli is not generally a vertex of, is Ai times 
the triangle flifl2fl3/ the ratio being negative if b and lie on opposite 
sides of the line 32^3; and analogously for the other triangles with b 
as a vertex and the other coefficients; see fig. IH 

Note again that lengths, areas, etc. belonging to non-parallel sub- 
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fl3 

A2 . A'l 



fl2 



Figure 4: Geometric meaning of the affine combination h = Aifli + 
/\2fl2 + '^s^Js: the ratios of the triangles 02^3^/ ^1^3^/ and flifl2^ to 0102^3 
are |/\2|, and \?^^\. The coefficient of fli is negative: A\ < 0, 
because b and fli he on opposite sides of the line ^2^3 



spaces cannot be directly compared. For that purpose one can use 
affine forms, two-forms, twisted forms, etc., which however will not 
be discussed in this note. For those I refer the reader to the books by 
Burke 112; HI, Bossavit 10; IH], and also Schouten H. 



2.4 References 

Excellent analytic and geometric introductions to affine spaces and 
mappings can be found in ch. 13 of Coxeter llz^l , ch. II of Artin (ztIi , 
§ I.l of Burke |22], and also in chs I-III of Schouten |@] and in Boehm 
& Prautzsch 128J. 
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3 Convex spaces 

3.1 Convex combinations and mixture spaces 

A convex space is analytically defined as a set of points wtdch is 
closed under the operation of convex combination, mapping pairs of 
points and pairs of non-negative real numbers summing up to one 
to another point of the space: 

{a,b,A,tx) ^ c = Aa + fib, A,^e[0, 1], \+fx = l. (5) 

This operation satisfies additional properties, and their analysis is 
interesting: Three of them. 



define a mixture space. To define a convex space, which is less general 
than a mixture space, we need two additional properties: 

I— > Afl + (1 — /\ )fo is injective for all A G ]0, 1 [ and all a, (6d) 

and 



Convex spaces are special amongst mixture spaces because they can 
always be represented as convex subsets of some affine space; this 
property does not need to hold for a generic mixture space. All 
such representations of a convex space are isomorphic to one an- 
other, and their affine spans are also isomorphic. This allows us to 
rewrite expressions like l|5ll as Afl + ^fc and to interpret them in the 
affine sense it also allows us to speak of the dimension of a con- 
vex space, defined as the dimension of the affine span of any of its 
representations, and to speak of other notions like parallelism and 
compactness. From now on we shall only consider convex rather 
than mixture spaces, and finite-dimensional, compact convex spaces 
in particular. See fig. |5] for some examples of equivalent and inequi- 
valent convex spaces. 



la + Qb = a, 
IJ.b + Aa = Afl-j- \ib, 
Ai[Afl + {1-A)b] + {1- ii)b = Aiia + {l - ^^l)b, 



(6a) 
(6b) 
(6c) 




for all A,iJ.e [0, 1] with Aii^l. (6e) 
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Figure 5: The two upper quadrilateral figures are the same convex 
space, whereas the lower, dashed, darker quadrilateral one is a dif- 
ferent convex space; analogously for the three rounded figures 

A set of points is convexly indq^endent if none of them can be writ- 
ten as a convex combination of the others. The extreme points of a con- 
vex space are the convexly independent points that convexly span the 
whole convex space (their set can be empty for non-compact convex 
spaces). A point can generally be written as a convex combination 
of extreme points in more than one way, so we cannot use them as 
a 'convex basis' to assign unambiguous numeric names to the other 
points (one can select a unique convex combination through addi- 
tional requirements, e.g. that its weights have maximum Shannon en- 
tropy). See fig.m But through the representation of the convex space 
in an affine space we can introduce an affine basis, whose elements 
can lie outside the convex space, and write every point of the convex 
space uniquely as an affine combination of these basis elements; this 
affine combination will not in general be a convex combination, i.e. 
its weights can be strictly negative or greater than unity. A weight 
lying in [0, 1] will be called proper, otherwise improper. 

A simplex is a convex space with a number of extreme points 
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Figure 6: Example of a convex set. The points fli, fl2/ ^3/ ^4 are ex- 
treme points of the set, as well as all points on the (thicker) curved 
part of the boundary. The point h can be written as a convex combin- 
ation of extreme points in at least two different ways: as ai/2-\- 
or as 02/3 + 2^4/3 

exceeding its dimension by one. The extreme-point decomposition 
of a point of a simplex is always unique, hence a simplex' extreme 
points constitute a canonical affine basis. A parallelotope is a convex 
space whose facets are pairwise parallel; it can be represented as a 
hypercube. See the right side of fig. |8] for the two-dimensional case. 

3.2 Plausibilistic forms 

We can consider mappings from a convex space to another that pre- 
serve convex combinations. When the range is the real numbers, 
we can speak of an affine form, since such a convex mapping can be 
uniquely extended to an affine form on the affine span of the convex 
space. This kind of mappings are also defined for a mixture space, 
and properties i6d) and ( |6e] l are equivalent to say that the mixture 
space is separated or non-degenerate, viz, for any pair of points there 
is a form having distinct values on them. In other words, a convex 
space is a mixture space in which each pair of points can be distin- 



A surjective affine mapping from a convex space onto one of 
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equal or lower dimensionality can be called a (parallel) projection. 
An injective affine mapping from a convex space into one of higher 
dimensionality can be called an (affine) embedding. 

Affine forms from a convex space S to the interval [0, 1] are es- 
pecially important. We call them plausibility forms. Convex combin- 
ations of these can be naturally defined; they therefore constitute a 
convex space, which can be given the name of plausibility space of S, 
denoted by P (5): 

V{S) := {v. S ^ [0, l]\vis affine (or convex)}. (7) 

I avoid the name 'dual space' because it risks to become overloaded 
and easily confused with other notions of duality (see e.g. § 3.4 in 
Griinbaum [30]). The action of a plausibility form u on a point a will 
be denoted hy v ■ a (confusion with affine forms on affine spaces is 
not likely to arise); once an affine basis is chosen in the convex space, 
this action can be written as matrix multiplication, as for affine forms. 
The forms i^q : fl i— > and Vi: a ^ 1 are called null-form and unit-form. 

A non-constant plausibility form on a convex space can be geo- 
metrically seen as a family of parallel hyperplanes (in the embedding 
affine space) between two given ones that do not intersect the space's 
interior. On each hyperplane the form has a constant value, with val- 
ues zero and unity on the utmost ones. These hyperplanes are also 
used to iconize the plausibility form, a mark being put on the unit 
one; see fig. [Zl 

Plausibility forms allow us to give this straightforward definition: 
A face of a convex space is a subset on which a plausibility form has 
value 0. A face is a convex space itself because any convex combin- 
ation of its points is still a point of the face. A facet is a face of one 
less dimension than the convex space. 

The convex structure of a plausibility space is determined by that 
of its respective convex space. Its affine span is the space of affine 
forms on the affine span of the original convex space. This means, 
from what we said about dual forms an bases in § 12.21 that a plaus- 
ibility space has one more dimension than the original convex space. 
Two of its extreme points are given by the null-form and the unit- 
form. The number of remaining extreme points is determined by the 
structure of the faces of the original convex space (for example, if 
the convex space is two-dimensional, the number of extreme points 
of its plausibility space is equal to 2m + 2, where m is the number 
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Figure 7: On the left, c is a plausibility form for the five-sided convex 
space; two lines are indicated where the form has values 1/2 and 2/3 
on the space. On the right, w cannot be a plausibility form (although 
it is an affine form) because it assigns strictly negative values to some 
points of the convex space 



of bounding directions of the convex space). See fig. |8] for two two- 
dimensional examples. The outcome space of an n-simplex is an 
(n + l)-parallelotope (which has 2"+^ extreme points). 



3.3 References 

Books on or touching convex spaces are Griinbaum's ijso'], Valentine's 



1 3111 , Alfsen's [32], Brondsted's [33]. Studies and examples of the dif- 
ference between mixture and convex spaces are presented by Mongin 
[ |29l[ and Wakker |3^ § V11.2]. Other examples, axiomatizations, and 
applications can be found in refs jH-Sl]. See also refs [|43-l55l[ for 
related topics. 

Infinite-dimensional convex spaces are less intuitive and require 
care in their study. The studies of Klee and others jsB-tsSl very 
interesting and provide appropriate references. 



4 An application: reference frames in classical 
mechanics 

In the introduction we hinted at the fact that in classical galileian- 
relativistic mechanics places are often represented by 'position vec- 



15 



Porta Mana 



Affine and convex spaces 




Figure 8: Two two-dimensional convex spaces, on the left, with their 
three-dimensional outcome spaces, on the right. The plausibilistic 
forms 10, u are represented as pairs of parallel lines on the convex 
spaces and as points on the outcome spaces. Vq and Vi are the null- 
and unit-forms. 
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tors' though they need not be modelled by vectors at all; in fact some 
operations that we can do with vectors, e.g. sums, do not have any 
physically meaningful counterpart for places. Places can instead be 
modelled by a Euclidean space, which is a particular example of af- 
fine space, one in which the additional notions of distance and angle 
are defined. Velocities, accelerations, forces maintain their vectorial 
character nevertheless. This is done as follows in the special case of 
point-mass mechanics: 

We assume as primitives the notions of point-mass, time, time 
lapse (i.e. a metric on the time manifold), and distance between 
any pair of point masses at each time instant. We postulate that, 
at each time instant, the net of distances among all point masses 
has a three-dimensional Euclidean character (e.g., theorems concern- 
ing triangles equalities and triangle inequalities are satisfied). This 
net of distances determines precise affine relations among the point 
masses; these relations are of course variable with time like the dis- 
tances themselves. The point masses can therefore be made to span 
a three-dimensional affine space at each time instant. The points 
of this affine space are what we call places, and each place is de- 
termined, in many equivalent ways, by an affine combination of the 
point masses. For example, at an instant t the affine combination 
ai(t)/4 — 3a2{t)/A + 6fl3(f)/4 determines a unique place b in terms 
of the point masses ai{t) , a2{t) , a^{t) . Different affine combinations 
can determine the same point: e.g., if fl4(i) = '2.a^{t) — a2{t) at t, then 
b is equivalently given by fli(t)/4 + 3fl4(f)/4. Note that, once the 
affine relations among the point masses are given, we do not need 
a notion of absolute distance to determine b, nor the ability to com- 
pare distances along unparallel directions; i.e. we do not need the 
Euclidean structure. 

At another time instant t' the mutual distances and affine re- 
lations between the point masses will be different; we may have 
e.g. a4{t') 7^ 2a^{t') — a2{t'). So it does not make sense to try to 
identify at t' the place b that we defined at t: should it be given 
by fli(f')/4 - 3fl2(f')/4 + 6fl3(t')/4? or by fli(f')/4 + 3fl4(f')/4? — 
the two combinations are inequivalent now. In other words, there is 
no canonical identification between the whole affine (and Euclidean) 
spaces at two different instants of time. This also means that there is 
no 'absolute space'. See fig. |9l 

But absence of a canonical identification does not mean that no 
identification at all is possible. A frame of reference is a particular. 
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Figure 9: The Euclidean net of distances among the point masses 
ai,a2,aj,,a.i determines an affine space at each time instant, e.g. 
and fj. The point masses can be identified at each instant, but a 
generic place hi determined at by a particular affine combination of 
the point masses has no counterpart at ti because the affine relations 
among the point masses have changed. 



arbitrary identification of the places of the affine spaces at any two 
instants of time, respecting the affine and Euclidean structure; i.e., it 
is a mapping, defined for any two instants t and f', 

Ft,y.At,^At, (8) 

between the Euclidean-affine spaces A\, At spanned by the point 
masses at those two instants, that preserves distances. It therefore 
preserves affine combinations: 

Ft,,,(Afl' + ^h') = AFt,^t{a') + ^Ft,,t{h'). (9) 

A frame of reference allows us to say that a particular place at time 
t is the 'same' as some place at time t' , so that we can use only one 
affine space for all times and we can say that a particular point mass 
'moved' from a place at t to another place at t' . See fig. 1101 

The velocity of a point mass' motion at the instant in a partic- 
ular frame F is defined as 

v{t,) := lim (10) 
t^lo t — to 
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Figure 10: K frame of reference is an arbitrary isomorphism between 
the places of the Euclidean-affine spaces at any two times. With 
respect to the mapping above we can say, e.g., that the point mass 
a\ occupies the same place at and t\, while the other point masses 
change place. Note, however, that the physical situation at t\ (and to) 
in this figure and fig. |9]is exactly the same. 



p(t) being the place occupied by the point mass at time t. The argu- 
ment of the limit is, for each t, the 'difference' between two points 
in the affine space associated to the instant t^: it is namely a transla- 
tion, as discussed in § 12.31 and therefore a vector. The limit is hence 
a vector, too. In this way we obtain the vectorial character of veloci- 
ties, accelerations, and in a similar way of forces, without the need to 
model places as vectors. Note again that only the affine structure of 
space enters in the expression above, not the Euclidean one (but we 
have a metric on the one-dimensional manifold that models time, as 
implied by the denominator of the fraction). 

This way of modelling space in classical mechanics is based upon 
and combines the works of Noll jsi; 82], Truesdell [83], and Zan- 
stra [84.-8 7.]. Apart from mathematical economy, it has the pedagogic 
advantage of presenting galileian-relativistic mechanics in a fashion 
closer to that of general relativity: in general relativity the set of 
events is a manifold that cannot be modelled as a (four-dimensional) 
vector space. Only (four-)velocities, accelerations, momenta have a 
vectorial character. 
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